Promise and pitfalls of extending Google's PageRank algorithm to citation networks.

نویسندگان

  • Sergei Maslov
  • Sidney Redner
چکیده

The misuse of journal impact factor in hiring and promotion decisions is a growing concern. This article is one in a series of invited commentaries in which authors discuss this problem and consider alternative measures of an individual's impact. The number of citations is the most commonly used metric for quantifying the importance of scientific publications. However , we all have anecdotal experiences that citations alone do not characterize the importance of a publication. Some of the shortcomings of using citations as a universal measure of importance include the following. (1) It ignores the importance of citing papers: a citation from an obscure paper is given the same weight as a citation from a groundbreaking and highly cited work. (2) The number of citations is ill suited to compare the impact of papers from different scientific fields. Due to factors such as size of a field and disparate citation practices, the average number of citations per paper varies widely between disciplines. An average paper is cited ϳ6 times in life sciences, 3 times in physics, and Ͻ1 times in mathematics. (3) Many groundbreaking older articles are modestly cited due to a smaller scientific community when they were published. Furthermore, publications on significant discoveries often stop accruing citations once their results are incorporated into textbooks. Thus, citations consistently underestimate the importance of influential old papers. These and related shortcomings of citation numbers are partially obviated by Google's PageRank algorithm (Brin and Page, 1998). As we shall discuss, PageRank gives higher weight to publications that are cited by important papers and also weights citations more highly from papers with few references. Because of these attributes , PageRank readily identifies a large number of scientific " gems " : modestly cited articles that contain groundbreaking results. In a recent study (Chen et al., 2007), we applied Google's PageRank to the citation network of the premier American Physical Society (APS) family of physics journals (Physical Review A–E, Physical Review Letters, Reviews of Modern Physics, and Physical Review Special Topics). Our study was based on all 353,268 articles published in APS journals since their inception in 1893 until June 2003 that have at least one citation from within this dataset. This set of articles has been cited a total of 3,110,839 times by other APS publications. Our study is restricted to internal citations— citations to APS articles from other APS articles. Other studies (Bollen et …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ArticleRank: a PageRank-based alternative to numbers of citations for analysing citation networks

Structured Abstract. Purpose: The paper suggests an alternative to the widely used Times Cited criterion for analysing citation networks. The approach involves taking account of the natures of the papers that cite a given paper, so as to differentiate between papers that attract the same number of citations. Method: ArticleRank is an algorithm that has been derived from Google's PageRank algori...

متن کامل

Competitive economy as a ranking device over networks

We propose a novel approach to generating a ranking of items in a network (e.g., of web pages connected by links or of articles connected by citations). We transform the network into an exchange economy, and use the resulting competitive equilibrium prices of the network nodes as their ranking. The widely used Google's PageRank comes as a special case when the nodes are represented by Cobb-Doug...

متن کامل

Applying weighted PageRank to author citation networks

This paper aims to identify whether different weighted PageRank algorithms can be applied to author citation networks to measure the popularity and prestige of a scholar from a citation perspective. Information Retrieval (IR) was selected as a test field and data from 1956-2008 were collected from Web of Science (WOS). Weighted PageRank with citation and publication as weighted vectors were cal...

متن کامل

PageRank for ranking authors in co-citation networks

This paper studies how varied damping factors in the PageRank algorithm influence the ranking of authors and proposes weighted PageRank algorithms. We selected the 108 most highly cited authors in the information retrieval (IR) area from the 1970s to 2008 to form the author co-citation network. We calculated the ranks of these 108 authors based on PageRank with the damping factor ranging from 0...

متن کامل

The Evaluation of the Team Performance of MLB Applying PageRank Algorithm

Background. There is a weakness that the win-loss ranking model in the MLB now is calculated based on the result of a win-loss game, so we assume that a ranking system considering the opponent’s team performance is necessary. Objectives. This study aims to suggest the PageRank algorithm to complement the problem with ranking calculated with winning ratio in calculating team ranking of US MLB. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • The Journal of neuroscience : the official journal of the Society for Neuroscience

دوره 28 44  شماره 

صفحات  -

تاریخ انتشار 2008